Tape data management

ABSTRACT

Systems and methods for managing data with respect to tape storage are provided. The system includes a data manager for receiving metadata related to content and for generating a proxy file system which mirrors the data structure of the content data. The system further includes a server coupled to a tape device, the server receiving the content data directly from a data source and providing the content data the tape device for storage on a formatted tape media.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of the filing date of U.S. Provisional Application No. 61/411,816, filed Nov. 9, 2010, entitled “Tape Data Management,” the disclosure of which is incorporated by reference.

BACKGROUND

The present invention relates generally to systems and methods for managing data with respect to tape storage, and more particularly to managing data with respect to data stored on tapes in a manner that supports a file system.

The rapid expansion in the use of digital media and the ease of communication using such have significantly increased the amount of data that media companies create and manage. More than ever, media companies desire to not only quickly generate such digital media but also quickly access, modify, and proliferate such data. This grand expansion in digital data has also seen a shift in the professional media community from recording on traditional film and tape to recording directly to digital media files. This trend has numerous advantages in speeding up the process of the shooting, editing and generally the creation of the movies, television and news programs. However, the tremendous amount of data may be difficult to preserve, edit, and access.

SUMMARY OF INVENTION

In accordance with an embodiment of the present invention, a system for providing tape data management is provided. In one aspect, the invention provides a method, performed by a computer, for storing information on tape, comprising: receiving stored files and information of a modification time of each of the stored files; commanding writing of the stored files to tape; writing a proxy file for each of the stored files to memory, each proxy file including an indication of the modification time of the corresponding stored file; and commanding writing of the proxy files to tape.

In another aspect, the invention provides a system for data management, comprising: a processor to receive metadata related to source content and extract information from the metadata to determine a data structure for storage of the source content; a server, coupled to data manager via a network connection, to receive from the processor the metadata and the data structure for storage of the source content and to receive the source content from a client and to execute instructions to cause the content data to be written to a formatted tape media of a tape device coupled thereto.

In another aspect, the invention provides a computer implemented method for managing data, comprising: receiving via a server, content data and metadata information, the content data being directly provided from a content data source; initializing a proxy file system; executing instructions to cause the content data, write sequence information and the proxy file system to be sequentially written to a formatted tape media.

In another aspect, the invention provides a computer implemented method for restoring content data from a tape, the method comprising: retrieving, from a formatted tape including content data, a binary file, the binary file including instructions which when executed cause a processor to: read directory structure information from the tape; and restore the content data included on the tape to a predetermined target location.

In another aspect, the invention provides a computer implemented method for self-verification of content data on a tape media, the method comprising: retrieving, from a formatted tape including content data, a binary file, the binary file including instructions which when executed cause a processor to: perform a checksum on the content data on the tape; compare the checksum with a correct checksum stored on the tape; provide an indication of a data verification result.

In another aspect, the invention provides a computer implemented method for generating a preview of content data stored on a tape storage medium, the method comprising: receiving content data from a source to be written to a tape storage medium; extracting metadata from the content data and using the metadata to generate a proxy data structure; copying a file included in the source content data and storing the copied file in the proxy data structure; writing the source content data to the tape storage medium; storing the proxy data structure in a storage device of a production station; accessing, via the production station, a file of the proxy data structure to generate a preview of a corresponding source file of the content data written to tape.

Other features and aspects of the invention will become apparent from the following detailed description, taken in conjunction with the accompanying drawings which illustrate, by way of example, the features in accordance with embodiments of the invention. The summary is not intended to limit the scope of the invention.

DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram illustrating a tape data management system in accordance with aspects of the present invention;

FIG. 2 is a diagram illustrating a process for tape data management in accordance with aspects of the present invention;

FIG. 3 is a diagram of a physical tape layout in accordance with aspects of the present invention;

FIG. 4 is a diagram illustrating a tape file system in accordance with aspects of the present invention;

FIG. 5 is a diagram illustrating a process for tape data management in accordance with aspects of the present invention;

FIG. 6 is a flow diagram illustrating a process for syncing data over multiple tapes in accordance with aspects of the present invention;

FIG. 7 is a flow diagram illustrating a process for recovering, or restoring, information on a tape using an executable program stored on the tape itself in accordance with aspects of the present invention;

FIG. 8 is a flow diagram illustrating a process for remotely restoring content from a tape in accordance with aspects of the present invention;

FIG. 9 is a flow diagram illustrating a process for performing a self verification of the contents of tape media in accordance with aspects of the present invention;

FIG. 10 is a diagram of a production pipeline for capturing and storing content in accordance with aspects of the present invention; and

FIG. 11 is a diagram of a post production pipeline in accordance with aspects of the present invention.

FIGS. 12 and 13 are block diagrams illustrating systems in accordance with aspects of the present invention.

DETAILED DESCRIPTION

FIG. 1 is a diagram illustrating a tape data management system in accordance with aspects of the present invention. Referring to FIG. 1, a tape data management system 100 includes a server 120 coupled to at least one tape drive 125 and at least one client 130, with a plurality of tape drives and a plurality of clients as illustrated in FIG. 1. The server may be directly connected to the tape device via a high speed connection, such as fibre channel or serial attached SCSI (SAS) for example. Alternatively, the server may be connected to the tape device via a network, such as a storage area network (SAN). The tape drives preferably include capability for a file system such as a linear tape file system (LTFS) for example. In some embodiments, the content may be written to an LTFS formatted LTO-5 tape. Other tape types may alternatively be used in various embodiments. The clients may be coupled to the server by way of a local area network or a wide area network, and may be or execute on personal computers or other workstations or other computer devices. In some embodiments, a tape drive may be coupled to a client device with the client commanding tape drive operation, and in such embodiments, the client device may perform functions ascribed to the server herein.

The server commands the tape drives to write data to and read data from tapes in the tape drives, also generates additional information regarding data written to the tapes, with the additional information in some embodiments also written to the tapes as well. The additional information includes, in most embodiments, modification time information indicating a modification time of source data written to the tapes. In some embodiments the additional information includes various items of metadata regarding the data written to the tapes.

In some embodiments, the source data is in the form of files, and the additional information is stored in the form of a proxy file system, with a proxy file for each file written to tape. The proxy file may include, or otherwise have associated with the file, information that indicates a time of modification of the source file. The proxy file may also include other information, such as image information (in some embodiments at a lower resolution than mage information of the source file written to tape), format information, creator information, project information, or other information. In addition, in some embodiments, proxy files are maintained in a directory structure matching, or corresponding, to a directory structure in which the source files are stored on the tape or stored on a source memory device. In such an arrangement, the proxy files and associated directory structure may be considered a proxy file system.

As illustrated in FIG. 1, the system includes a data manager 110, which is optional in various embodiments. The data manager may serve as a central policy manager and be comprised of a central database, a web engine, and policy code. The clients and server register with the data manager. The data manager monitors connectivity with the clients and server using for example periodic signal, such as a signal that may be referred to as a heartbeat signal. In some embodiments, the data manager may synchronize a job or data transfer that is executing on a server, a client system or both. Further, the data manager may provide a web interface to enable users to add, edit, view and monitor data transfer jobs remotely.

When source content is to be written to tape, the source content metadata may extracted and provided to the data manager. In some embodiments, the source content metadata is not processed by the server, instead the data manager is configured to intelligently read the metadata. That is, the data manager parses the metadata to determine the file structure of the content data. The data manager stores the extracted metadata and provides a copy to a server that will conduct the data backup of the source content. In other embodiments, the server is instead configured to determine and/or generate the metadata, and place the metadata for example in the form of a proxy file system, which the server may provide to the data manager. Provision of the proxy file system to the data manager is beneficial in that potentially the clients, or other servers, may access the information of the proxy file system from the data manager.

FIG. 2 is a flow diagram illustrating a process in accordance with aspects of the present invention. In some embodiments the process of FIG. 2 is performed by the server of FIG. 1. In some embodiments, a client of FIG. 1 performs the process of FIG. 2, for example if the client is coupled to a tape drive instead of or in addition to the server. At block 210, the process starts. At block 220, the process initializes a proxy file system, which may be a proxy file system as discussed with respect to FIG. 1. In some embodiments, the proxy file system is initialized to include a file for each of the files to be written to tape, with in some embodiments, the files arranged in a directory structure mirroring a directory structure by which the files are to be accessible or visible on the tape. In some embodiments, the proxy file system may be constructed to include a mirror of the native data structure for the source content that is to be written to tape. In some embodiments, the proxy file system may be configured to only include a mirror of the source file structure

At block 230, the process writes a file to tape. The writing operation preferably is accomplished in accordance with the Linear Tape File System (LTFS) Format Specification, for example version 1.0 or 2.0.0, the disclosures of both which are incorporated herein by reference for all purposes in their entirety, and both of which at the time of writing are publicly available from IBM.

The write operation may be performed via a local connection or via a network connection (e.g., LAN or WAN). In some embodiments, the system uses a very small and temporary disk cache. It uses this disk cache to perform the parallel sync used to achieve WAN optimization.

In some embodiments the system uses buffer resizing to calculate the size of the send and receive buffers. These buffers are tuned by using the available bandwidth and the latency between the source and target. The calculation for buffer sizes are described as follows:

-   -   Send buffer in Kbytes—Bandwidth (Kbytes/sec)*Round trip latency         (secs)     -   Receive buffer is ½ the send buffer

In some embodiments, the system may use a retry algorithm. Since networks often drop connections that do not respond for an extended period of time, the system may be tuned to recognize this as normal and resends a sync request to ensure that a sync is proceeding. This may be useful in avoiding timeouts due to an extended tape seek, for example. In some embodiments, multiple retries maybe sent before determining that a connection has been lost and closing a connection.

At block 240, the process stores the proxy file system information. The process may store the proxy file system in memory associated with the server, though preferably not the tape as doing so may delay progress of writing of the source files to tape. For each file written to tape, a corresponding proxy file is generated. The proxy file includes the modification time of the file written to tape. In various embodiments the proxy file additionally includes attribute data related to the source file written to tape, such as, file size, extended attributes and metadata. In various embodiments the proxy includes some information of the source file, for example when the source file is an image the proxy file may additionally include an image of a lower resolution than the image of the source file.

In accordance with aspects of the present invention, the proxy file system may be stored on the data manager, the server, or a client system or a combination thereof. At block 250, the process stores write sequence information. The write sequence information is recorded and maintained so that the written content data may be read back in the same sequence that the source content data was written. As such, seek operations may be minimized and the speed at which the data may be read is increased.

Additional information including, for example, an archive history which includes a history of the content written to a tape may also be generated and stored. Where multiple tapes are used for an archive, the archive history may also include identification for the tapes used for the archive and the location directories each time content was synced to tape for the archive.

At 260, the process determines if there are more files to be transferred. If there are more files to be written, the process returns to block 230 and repeats. On the other hand, if there are no additional files to be written, at block 270, the process writes the proxy file system to tape and writes the write sequence information to tape. The proxy files system and write sequence information may also be written to disk on the data manager, the server, or a client system. As such, an indication of the contents of the tape, and in some embodiments information regarding or of the contents of the tape may be viewable and searchable from a remote system.

FIG. 3 is a diagram illustrating a physical layout of a tape in accordance with aspects of the present invention. The tape includes a first wrap 301 a, a second wrap 301 b, and subsequent wraps to an end wrap 301 n. As shown in FIG. 3, which is not to scale (and in which the left-hand portion may be considered a first end of the tape, a center portion an opposite second end of the tape, and the right hand portion a return to the first end of the tape), a first wrap of the tape (forming a first partition) includes an index, Index 1, of the contents of the subsequent wraps of the tape (forming a second partition). As illustrated in FIG. 3, the second wrap includes additional information, with files labeled Clip 1, Clip2_1, Clip 2_2 and Clip 3 written in succession on the tape. The files, for example, may have video clip information, and each file may be considered a video clip. The video clips are followed on the tape by proxy file system information, which may be stored in a single file or multiple files, and a file including write sequence information. When the proxy file system and write sequence data are maintained on disk during content mirroring, the content data, proxy file system and write sequence information may be included in sequential blocks on the tape. Accordingly, subsequent retrieval of the data on the tape may be optimally performed. The remaining tape is empty and available for subsequent data syncs.

In some embodiments, the data manager, server and/or client systems may be configured to execute instruction for generating and displaying a listing of files in a proxy file system displaying an image, for example.

FIG. 4 is a diagram illustrating an example file system listing for an example file system in accordance with aspects of the present invention. Referring to FIG. 4, the file system is organized in a directory structure. The directory structure includes a directory for content data included in a write sequence. Further, a subdirectory may be provided to capture incremental changes to the content data. The file system listing may also include additional information regarding the captured content data such as the date and time the content was created, modified and/or when the content was written to tape.

FIG. 5 is a flow diagram illustrating a process for updating information written to a data tape in accordance with aspects of the present invention. In some embodiments the process of FIG. 5 is performed by the server of FIG. 1. In some embodiments, a client of FIG. 1 performs the process of FIG. 5, for example if the client is coupled to a tape drive instead of or in addition to the server. At block 510, the process starts. At block 520, the process compares source file information with proxy file system information. This comparison determines whether any of the previously archived files have been modified since being written to tape or whether new files have been added. The comparison may be performed by the data manager, the server or a client system. In some embodiments, the comparison is performed using file modification time information of the source file and file modification time as indicated by the proxy file system.

At block 530, the process determines whether the changes have been made. When there have been no changes made to the files included in the proxy file system, the process terminates. Conversely, if the process determines that there are changes with respect to the source files, the process continues to block 540. At block 540, the process initializes an additional proxy file system. The proxy file system is, for example, as discussed with respect to block 220 of the process of FIG. 2, but only including any files and associated directory structure for changed or new files. At block 550, the process writes a changed or new source file to tape. At block 560, the process stores the proxy file system information and write sequence data, for example as discussed with respect to block 240 of the process of FIG. 2. At block 570, the process determines if there are more changed or new files to be written to tape. If there are more files to be written to tape, the process returns to block 550 to continue writing the additional files to tape and storing the proxy and write sequence data. When there are no more files to be written to tape, at block 580, the process writes the additional proxy file structure and write sequence information to tape. At block 590, the process returns.

FIG. 6 is a flow diagram illustrating a process for syncing data over multiple tapes. In some embodiments the process of FIG. 6 is performed by the server of FIG. 1. In some embodiments, a client of FIG. 1 performs the process of FIG. 6, for example if the client is coupled to a tape drive instead of or in addition to the server. At block 610, the process starts. At block 620, the process determines the available space on a tape. At block 630, the process writes the file to tape. At block 640, the process checks whether space is available to continue writing. If there is space available, the process continues, repeating the steps of writing the next file to tape and checking space availability. When there is no more space available on the tape for writing, the process closes the sync and at block 650 checks whether a new tape has been installed. If a new tape has not been installed, the process will wait and repeatedly check for a new tape. When a new tape is installed, at block 660, the process performs an incremental sync, for example using the process of FIG. 5, to write the remaining files to tape.

In some embodiments, the process of checking for changed files, initializing additional proxy file system and writing the additional files to tape as described above with respect to blocks 530, 540, and 550, may be continually executed as a background process performed by the server, data manager or a client to enable continuous syncing of the content data. This is advantageous because, further reduces the potential for lost content and also reduces downtime related to system data backups.

In some embodiments, the process may retrieve information regarding the writable space remaining on the tape. The process may determine an amount of information to be written to tape. If there is sufficient space to complete the backup, the process writes the information and closes the sync. If there is not sufficient space, the process determines a first portion of the data to be written that will fit in the remaining writeable space of the tape and writes this portion of information. The process may wait for an additional tape to be installed in a tape device to continue writing the remaining portion of the information to be written.

FIG. 7 is a flow diagram illustrating a process for recovering, or restoring, information on a tape using an executable program stored on the tape itself. In some embodiments the process of FIG. 7 is performed by the server of FIG. 1. In some embodiments, a client of FIG. 1 performs the process of FIG. 7, for example if the client is coupled to a tape drive instead of or in addition to the server. The program, for example, is an executable program in binary form, and the program may be executed, again for example, by the server of FIG. 1. As the program is preferably stored on the tape in binary form, the program may be considered a self-restoring binary for the tape.

The self restoring binary manages the process of efficiently recreating the point in time from the tape to the target disk. The self-restoring binary allows for both partial files and full snaphot recovery and also uses the write sequence to enable highly efficient reads directly from tape. The self restoring binary may be stored on the main LTFS volume, for example in partition 0, and may be visible when the mounted. Further, the binary may be stored in the root location under a hidden folder called .xxx_self_restore or other location and/or name according to user preference.

Referring to FIG. 7, the process starts at block 710. At block 720, the process copies a binary from tape to memory. At block 730, the process begins execution of the binary. The binary may request a user to specify a target location for the restored content or use a predetermined target location. At block 740, the process reads the directory structure information. The process may read an archive history to determine the location of content included in the archive and whether the archive spans multiple tapes. The directory information may further include write sequence information and proxy file system information. At block 750, the process writes the latest file versions to target locations. In some embodiments, the binary determines latest versions of files on the tape by accessing proxy file information stored on the tape. In some embodiments, the binary uses an overlay approach and syncs the latest content from tape to a target and applies the correct modified times which may be retrieved from the proxy file system included on the tape, to the restored content. At block 760, the process returns.

When the archive spans multiple tapes, the process may restore content from a first tape and prompt the user to insert subsequent tapes and continue the restore content to a target location until the entire archive is restored.

FIG. 8 is a flow diagram illustrating a process for remotely restoring content from a tape in accordance with aspects of the present invention. The process may be performed by systems, or portions thereof, described herein. The process starts at block 810. At block 820, the process receives a request to restore content from tape. At block 830, the process determines a location for the backup. In one embodiment, restore information may be retrieved from the data manager. The restore information may include, for example, a serial number for the tape(s) used in creating the archive, the location of such tapes, a target location for restoring the content, the metadata, and proxy file system. At block 840, the process sends the restore information to a server coupled to the tape device to be used in conducting the restore. At block 850, the process begins executing the restore. The content to be restored from tape need not be sent to the data manager, but rather may be sent directly to the target location. When the content has been restored, the process returns at block 860.

FIG. 9 is a flow diagram illustrating a process for performing a self verification of the contents of tape media in accordance with aspects of the present invention. In some embodiments the process of FIG. 9 is performed by the server of FIG. 1. In some embodiments, a client of FIG. 1 performs the process of FIG. 9, for example if the client is coupled to a tape drive instead of or in addition to the server. When content is synced to tape, a checksum of select data blocks may be stored in an XML. In some embodiments a spot checksum may be used to improve process performance. In addition, the checksum may be stored in an encrypted form for security purposes.

Referring to FIG. 9, the process starts at block 910. At block 920, the process copies binary from tape to memory. At block 930, the process begins execution of the binary. At block 940, the process performs a checksum on the content on the tape. At block 950, the process compares the checksum with the checksum information stored on tape. At block 960, the process provides an indication of the result of the checksum comparison. At block 970, the process returns.

The data management systems and methods described above are broadly applicable to many media environments. Media environments are quickly transitioning to a file based workflow—from capture, to post production, exchange and distribution. A large number of production pipelines now utilize file based cameras that generate terabytes of file based video files rather than traditional video tapes.

FIG. 10 is a diagram of a production pipeline for capturing and storing content in accordance with aspects of the present invention. The production pipeline may include a capture device 1010 including a data storage media, an ingest device 1020, a server 1030 and a tape device 1040. The capture device may be a video camera, digital tape recorder or other media recording device. The data storage media may be a hard drive, flash drive or other data storage device. The ingest device receives the content stored on the data storage media and transmits the data to a server. The content data may be received in various formats on a storage media, including, but not limited to, flash card, memory stick, compact disc, DVD, Blu-ray, or hard drive. The server may extract metadata related to the content to determine the data structure. The server may send the content data and related information (e.g., metadata, write sequence), to the tape device which may then write the content data and related information to a tape. In some embodiments, a checksum may be performed on the source content and the checksum output may be written to tape along with the content data. In some embodiments, the checksum may be performed while the content is being written to tape. In some embodiments, a second copy of the tape archive may be made and stored at a remote site for disaster recovery purposes.

FIG. 11 is a diagram of a post production pipeline in accordance with aspects of the present invention. As shown in FIG. 11, content data captured in the field may be provided to server 1110 and stored on a tape device 1120 coupled thereto. In traditional systems, the content would have to be re-ingested into edit applications before the content could be browsed. However, since the contents are stored on LTFS formatted tapes, the content may be directly accessed and browsed.

The content may be edited directly from the tape using an editing application or may alternatively be restored, as previously described, to a primary storage device such as a hard disk or a shared storage area network 1130 for editing. After the content has been edited, the content data may be offloaded and archived to tape.

During the editing process, the system enables editors to check-in and share their work-in-progress sequences and associated media. Editors may export their edited sequences using XML or Advanced Authoring Format (AAF) exports, for example. An incremental sync may be performed to archive any changes made to the content during the editing session. The exported sequence may thus be shared with other editors that may further modify the sequence, while maintaining the ability to easily revert to a prior version

FIGS. 12 and 13 are block diagrams illustrating systems in accordance with aspects of the invention. Referring to FIG. 12, the system may include a server 1210, a tape device 1220, and an offsite facility 1280. The system may also include client systems 1230, additional server 1240 and data storage such as SAN 1250. As the clients and servers generate and edit content data, the content data may be quickly preserved and written to a tape or multiple tapes (e.g., 1260 a, 1260 b, 1260 c) as discussed above in connection with FIGS. 1-6. For purposes of disaster recovery, these tapes may be sent offsite facility for storage. In the event of a disaster, including the loss of source content on any of the client or server systems, the tapes maintained offsite may be sent back to main facility and quickly restored without the aid of additional software as discussed above with respect to FIG. 7.

As shown in FIG. 13, a tape device may also be maintained offsite. The content data may be sent to a remote server via the Wide Area Network and then written to tape as discussed above with respect to FIGS. 1-6. In some embodiments, WAN acceleration techniques may be used to improve data transfer efficiency. WAN acceleration techniques may include, for example, the use of parallel data streams and/or buffer resizing. The tapes (e.g., 1360 a. 1360 b) created may be maintained at an offsite facility. Accordingly, in the event of a disaster, including the loss of source content on any of the client or server systems, the tapes maintained offsite may be sent back to main facility and quickly restored without the aid of additional software as discussed above with respect to FIG. 7.

While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example, and not limitation. It will be apparent to persons skilled in the relevant art that various changes in detail can be made therein without departing from the spirit and scope of the invention. Thus, the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the claims supported by this disclosure and their insubstantial variations. 

1. A method, performed by a computer, for storing information on tape, comprising: receiving stored files and information of a modification time of each of the stored files; commanding writing of the stored files to tape; writing a proxy file for each of the stored files to memory, each proxy file including an indication of the modification time of the corresponding stored file; and commanding writing of the proxy files to tape.
 2. The method of claim 1, wherein at least some of the stored files include video information, and wherein proxy files for the stored files including video information include a version of the video information at a lower resolution than the resolution of the video information in the stored files.
 3. The method of claim 1, wherein in the proxy files are stored in a directory structure matching a directory structure for the received stored files.
 4. The method of claim 1, wherein the proxy files are stored in a directory structure matching a directory structure for the stored files as written to tape.
 5. The method of claim 1, further comprising writing a sequence to memory, the sequence indicating order in which stored files were written to tape, and commanding writing of the sequence to tape.
 6. The method of claim 1, further comprising: determining whether a stored file has a modification time later than a modification time indicated by a proxy file for the stored file; and commanding writing of the stored file to tape if the stored file has a modification time later than a modification time indicated by the proxy file for the stored file.
 7. A system for data management, comprising: a processor to receive metadata related to source content and extract information from the metadata to determine a data structure for storage of the source content; a server, coupled to data manager via a network connection, to receive from the processor the metadata and the data structure for storage of the source content and to receive the source content from a client and to execute instructions to cause the content data to be written to a formatted tape media of a tape device coupled thereto.
 8. The system of claim 7, wherein the source content is stored in a file system on the formatted tape media.
 9. The system of claim 8, wherein the file system is a linear tape file system.
 10. A computer implemented method for managing data, comprising: receiving via a server, content data and metadata information, the content data being directly provided from a content data source; initializing a proxy file system; executing instructions to cause the content data, write sequence information and the proxy file system to be sequentially written to a formatted tape media.
 11. The computer implemented method of claim 10, wherein the tape media is formatted to provide a linear tape file system.
 12. The computer implemented method of claim 10, wherein the content data source is a client coupled to the server via a network.
 13. The computer implemented method of claim 10, wherein the content data source is a storage area network coupled to the server.
 14. A computer implemented method for restoring content data from a tape, the method comprising: retrieving, from a formatted tape including content data, a binary file, the binary file including instructions which when executed cause a processor to: read directory structure information from the tape; and restore the content data included on the tape to a predetermined target location.
 15. The computer implemented method of claim 14, wherein the tape is an Linear Tape Open version 5 (LTO-5) tape.
 16. The computer implemented method of claim 14, wherein the tape is formatted with a Linear Tape File System (LTFS).
 17. The computer implemented method of claim 14, wherein the directory structure information includes an archive history.
 18. The computer implemented method of claim 14, wherein the directory structure information includes proxy file system information.
 19. A computer implemented method for self-verification of content data on a tape media, the method comprising: retrieving, from a formatted tape including content data, a binary file, the binary file including instructions which when executed cause a processor to: perform a checksum on the content data on the tape; compare the checksum with a correct checksum stored on the tape; provide an indication of a data verification result.
 20. The computer implemented method of claim 19, wherein the indication of a data verification result includes displaying on a display device, a message indicating a checksum error when the checksum is not equal to the correct checksum stored on the tape.
 21. A computer implemented method for generating a preview of content data stored on a tape storage medium, the method comprising: receiving content data from a source to be written to a tape storage medium; extracting metadata from the content data and using the metadata to generate a proxy data structure; copying a file included in the source content data and storing the copied file in the proxy data structure; writing the source content data to the tape storage medium; storing the proxy data structure in a storage device of a production station; accessing, via the production station, a file of the proxy data structure to generate a preview of a corresponding source file of the content data written to tape.
 22. The computer implemented method of claim 21, wherein the source file includes an image and the preview includes a display of a lower resolution version of the image.
 23. The computer implemented method of claim 21, wherein the production station may further access the proxy data structure to generate a listing of the files included in the source content data.
 24. The computer implemented method of claim 23, wherein the listing includes a modification time for a file included in the source content.
 25. The computer implemented method of claim 23, wherein the listing includes a size of a file included in the source content. 